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The gene encoding the human TLS protein, also 
termed FUS, is located at the site of chromosomal trans- 
locations in human leukemias and sarcomas where it 
forms a chimeric fusion gene with one of several differ- 
ent genes. To identify interacting partners of TLS, we 
screened a yeast two-hybrid cDNA library constructed 
from mouse hematopoietic cells using the C-terminal 
region of TLS in the bait plasmid. Two cDNAs encoding 
members of the serine-arginine (SR) family of proteins 
were isolated. The first SR protein is the mouse homolog 
of human splicing factor SC35, and the second SR mem- 
ber is a novel 183-amino acid protein that we term TASR 
(TLS-associated serine-arginine protein). cDNA cloning 
of human TASR indicated that mouse and human TASR 
have identical amino acid sequences. The interactions 
between TLS and these two SR proteins were confirmed 
by co-transfection and immunoprecipitation studies. In 
vivo splicing assays indicated that SC35 and TASR influ- 
ence splice site selection of adenovirus E1A pre-mRNA* 
TLS may recruit SR splicing factors to specific target 
genes through interaction with its C-terminal region, 
and chromosomal translocations that truncate the C- 
terminal region of TLS may prevent this interaction. 
Thus TLS translocations may alter RNA processing and 
play a role in malignant transformation. 



Chromosomal translocations are found frequently in leuke- 
mias as well as in malignancies of non-hematopoietic tissues. 
These translocations usually give rise to novel fusion genes and 
novel fusion proteins. To understand the role that these fusion 
proteins play in cellular transformation, knowledge of the func- 
tion of the wild- type protein involved in the translocations is 
required. 

We have focused these studies on understanding the function 
of the TLS gene. TLS (translocated in liposarcoma), also called 
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FUS, was originally identified through its fusion to the CHOP 
gene (a member of the C/EBP family of transcription factors) in 
human myxoid liposarcoma with the t(12;16) chromosomal 
translocation (1, 2). In human acute myeloid leukemia with the 
t(16;21) translocation, the TLS gene is fused with the ERG 
gene (a member of the ETS family of transcription factors) (3). 
In translocations involving TLS, the N- terminal part of TLS is 
retained in the fusion protein and the C-terminal region is 
deleted. 

TLS is a member of a closely related family of genes, includ- 
ing the EWS gene, which was originally identified in Ewing's 
sarcoma (4). Both TLS and EWS are involved in various types 
of cancers through chromosomal translocation either with 
other genes of the ETS family or with other transcription 
factors (see Ref. 5 for a review). The evidence that transcrip- 
tional activation plays a role in transformation mediated by 
TLS (or EWS) fusion proteins stems from the observations that 
the N terminus of the TLS protein is rich in glutamine, serine, 
and tyrosine and is a potent transactivator when fused with 
various transcription factors (6). 

Two observations suggest that the transactivational activity 
of the TLS fusion protein may not be sufficient to explain the 
role of the fusion protein in transformation. First, a correlation 
between transactivation and transformation by these fusion 
proteins has not been demonstrated (7). Second, the C terminus 
of TLS contains several motifs that are suggestive of other 
potential TLS functions. The TLS C-terminal region contains 
an evolutionarily conserved RNP 1 consensus sequence and 
Arg-Gly-Gly (RGG) repeats, both of which have been implicated 
in RNA binding (8). In previous studies, TLS has been shown to 
bind to RNA in both the nucleus and the cytoplasm and was 
hypothesized to be a heterogeneous ribonuclear protein-like 
chaperone of RNA (9). In a recent study, TLS was found to 
interact with PU.l, an ETS protein capable of regulating tran- 
scription and RNA splicing. Overexpression of TLS in IW1^32 
erythroid cells was shown to promote the use of the distal 
5'-splice site during E1A pre-mRNA splicing. This site prefer- 
ence is counterpoised by PU.l, suggesting that TLS may be 
part of a protein network involved in the regulation of RNA 
processing (10). 

Because the C terminus of TLS is replaced in fusion proteins 
as the result of chromosomal translocation, loss of TLS inter- 
actions with cellular regulators may play an important role in 
transformation. To identify protein molecules that interact 
with TLS, we used the C-terminal part of TLS protein as the 
bait to screen a yeast two-hybrid library derived from mouse 
multipotential hematopoietic cells (11). We isolated two cDNAs 
that encode TLS-interacting proteins with serine-arginine 
(SR)-rich domains. The first cDNA corresponds to the mouse 
homolog of the human splicing factor SC35, and the second 
cDNA encodes a novel protein that we term TASR (TLS-asso- 
ciated serine-arginine protein). The SR family of proteins has 
been shown previously to be involved in constitutive and reg- 
ulated RNA splicing. 



1 The abbreviations used are: RNP, ribonucleoprotein; SR, serine- 
arginine; PCR, polymerase chain reaction; EML, erythroid, myeloid, 
and lymphoid; RT, reverse transcription; PAGE, polyacrylamide gel 
electrophoresis; DOTAP, iV-[dioleoyloxy]propyl-trimethylammonium 
methylsulfate. 
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EXPERIMENTAL PROCEDURES 

Plasmid and Library Constructions — To generate a TLS bait pi asm id 
for yeast two-hybrid screening, human TLS cDNA was PCR-amplified, 
digested with Eco RI, and cloned into the EcoRl-Smal sites of the 
pBTM116 vector (12) to generate the bait plasmid pBTM-TLS in which 
the C -terminal region of TLS protein is fused to LexA DN A- binding 
protein (see Fig. 1). For expression of Flag- tagged TLS protein in COS-7 
cells, the entire coding region of the TLS cDNA was subcloned in-frame 
into the EcoRl-Smal sites of pSG5-Flag vector (CLONTECH) to gener- 
ate pSG5-Flag-TLS. Flag-tagged luciferase expression plasmid pSG5- 
Flag-Luc was constructed by ligation of the pSG5-Flag vector with the 
luciferase insert from pGL3 plasmid (Promega). Myc-tagged expression 
constructs pCS2-MT-SC35 and pCS2-MT-TASR were generated by in- 
frame ligation of mouse SC35 and TASR coding sequences into the 
EcoRl-Stul sites of pCS2-MT vector. For overexpression of the SC35 
and TASR proteins in HeLa cells, full-length mouse SC35 and TASR 
cDNAs were cloned into the EcoRl-EcoKV sites of pCR3 vector (Invitro- 
gen) to generate plasmids pCR3-SC35 and pCR3-TASR. Plasmid pCS3- 
MT-E1A was from Dr. Moreau-Gachelin and described previously (10). 
The yeast two-hybrid library was constructed with mRNAs from the 
murine hematopoietic cells with erythroid, myeloid, and lymphoid 
(EML) potential (11, 12). 

Two-hybrid Screen — The yeast two-hybrid screen was performed as 
described previously (1 1). 

cDNA Cloning— To obtain full-length mouse SC35 and TASR cDNA 
sequences, the SC35 and TASR inserts obtained using the yeast two- 
hybrid screen were used as the probe in the hybridization screening of 
a Uni-ZAP phage cDNA library derived from EML cells. pBluescript 
phagemids containing full-length SC35 and TASR cDNAs were pre- 
pared after in vivo excision from the Uni-ZAP XR vector and were used 
in sequencing reactions with Dye terminators (Applied Biosystems). To 
obtain human TASR cDNA, total RNA was isolated from K562 leuke- 
mia cells and used as a template for amplification by RT-PCR. 

Immunoprecipitation and Western Blot Analysis — For expression of 
Flag- or Myc-tagged proteins, 10 /ig of pSG5- Flag-expression construct 
and 10 fig of pCS2-Myc-expression construct were introduced into 3 X 
10 6 COS-7 cells by electropo ration. 48 h after electroporation, the cells 
were lysed with 0.6 ml of lysis buffer A (10 mM Tris-HCl, pH 7.4, 2.5 mM 
MgCl 2 , 100 mM NaCl, 0.5% Triton X-100). 3 fd of polyclonal rabbit 
anti-Flag D8 antibody (Santa Cruz Biotechnology) or 3 jul of monoclonal 
mouse anti-Myc 9E10 antibody (Sigma) was first incubated with 30 ^tl 
of protein A/G agarose (Santa Cruz Biotechnology) for 50 min at 4 °C in 
0.3 ml of buffer A, and the antibody-protein A/G- agarose complex was 
then incubated with 0.2 ml of Iysate for 20 min at 4 °C with gentle 
rocking. After washing with radioimmune precipitation buffer 4 times, 
50 /xl of SDS-PAGE sample buffer was added to the agarose beads. The 
samples were heated at 100 °C for 3 min, 20 fi\ of the sample was 
separated by SDS-PAGE in a 10% gel, and the proteins were detected 
with a monoclonal mouse anti-Flag M2 antibody (Sigma) or a mono- 
clonal mouse anti-Myc 9E10 antibody as described under "Results and 
Discussion." Protein bands were visualized using the ECL Western 
blotting analysis system (Amersham Pharmacia Biotech). 

In Vivo Splicing Assay — For transient transfection of HeLa cells, 1.7 
fig of pCS3-MT-ElA and 1.7 fig of pCR3-construct plus 1.7 fig of pSG5- 
Flag-construct were mixed with 30 fA of DOTAP (Boehringer Mann- 
heim), and the DNA- DOTAP mixture was added to a 60- mm dish of 75% 
confluent HeLa cells according to the manufacturer's instructions. 40 h 
after addition of the DNA-DOTAP mixture, total RNA was purified 
from transfected HeLa cells with an RNeasy Mini Kit (Qiagen). RT-PCR 
amplification of various E1A isoforms was carried out as described 
previously (13) with 5 ' G AGCTTGGGCG ACCTC A3 ' (RR67) as the for- 
ward primer and 5 ' AATACGACTC ACTATAG3 ' (T7) as the reverse 
primer. 

RESULTS AND DISCUSSION 

The structural features of TLS, fusion proteins TLS/CHOP 
and TLS/ERG, and the C-terminal region of TLS used in the 
bait plasmid are shown (Fig. 1). TLS sequence features impli- 
cated in RNA binding include the Arg-Gly-Gly repeats (RGG) 
and the evolutionarily conserved ribonucleoprotein consensus 
sequence (RNP-CS). 

In the initial yeast two-hybrid screen of 1 X 10 7 transfor- 
mants, we identified two independent clones that were positive 
for interaction with the TLS bait. Sequencing of the first clone 
indicated that it was the mouse homolog of human SC35, one of 
the SR-rich mammalian splicing factors (14). Sequencing of the 
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Fig. 1. Diagrams of TLS and TLS fusion proteins. Normal TLS 
protein is shown with distinct sequence features. QSY f glutamine-, 
serine-, and tyrosine-rich domain; RGG, regions with multiple Arg-Gly- 
Gly repeats; RNP-CS, ribonucleoprotein consensus sequence. Sites of 
TLS/CHOP and TLS/ERG fusions are indicated. The TLS region used 
as a bait in the yeast two-hybrid screen is indicated. 
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Fig. 2. Amino acid sequence of TASR and its structural com- 
parison to SC35. A, the predicted amino acid sequence of TASR 
protein. Both mouse TASR (GenBank accession no. AF042383) and 
human TASR (GenBank accession no. AF047448) are identical at the 
amino acid level. RNP2 and RNPl consensus sequences are underlined, 
Arg-Ser or Ser-Arg di pep tide repeats are shown in boldface. B, sche- 
matic comparison of TASR to SC35. The RNP consensus sequences are 
shown in gray boxes. The glycine-rich hinge is in a hatched box, and the 
RS domains are in boxes with wavy lines. 



second positive clone revealed the presence of C-terminal ser- 
ine-arginine repeats, whereas the remainder of the sequence 
did not align with sequences in the GenBank data base. In- 
frame translation of the cDNA insert indicated that it repre- 
sented a novel TLS-associated protein containing multiple ser- 
ine-arginine repeats; therefore the protein was named TASR 
(TLS-associated protein with serine-arginine repeats). 

To obtain full-length mouse SC35 and TASR cDNAs, the 
corresponding cDNA inserts were used to screen an EML cDNA 
phage library. Several clones containing full-length SC35 and 
TASR cDNAs were isolated. Nucleotide sequence analysis re- 
vealed that mouse SC35 cDNA is approximately 1.9 kilobases 
in length (GenBank accession no. AF077858) and encodes a 
protein nearly identical to human SC35. Mouse TASR cDNA is 
approximately 3.4 kilobases in length (GenBank accession no. 
AF042383). The sequence surrounding the first ATG codon of 
mouse TASR cDNA is an excellent match to the Kozak trans- 
lation initiation consensus (15). The deduced amino acid se- 
quence of the TASR open reading frame corresponds to a pro- 
tein of 183 amino acids with a calculated molecular mass of 22 
kDa (Fig. 2A). Prominent in the TASR sequence are RNP2 and 
RNPl motifs, which are signatures of RNA-binding proteins, as 
well as multiple serine-arginine repeats, which are a charac- 
teristic feature of RNA splicing factors. Even though a search of 
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Fig. 3. Association of SC35 and TASK with TLS in COS-7 cells. 

A, Plasmid expressing Myc-tagged SC35 protein was co-transfected into 
COS-7 cells with plasmid expressing Flag-tagged TLS or Flag-tagged 
luciferase. The cell lysates were i mm uno precipitated with a polyclonal 
rabbit an ti- Flag antibody (lanes 1 and 2) or with a normal rabbit IgG 
(lane 3). The immunoprecipitates (IP) were isolated, subjected to gel 
electrophoresis, and immunoblotted with a monoclonal mouse anti-Myc 
antibody or a mouse monoclonal anti-Flag antibody. The reciprocal 
immunoprecipitation was carried out with a monoclonal mouse anti- 
Myc antibody (lanes 4 and 5) or a normal mouse IgG (lane 6). The 
precipitated proteins were separated and immunoblotted with a mono- 
"clonal mouse anti-Flag or a monoclonal mouse anti-Myc antibody. B t 
experiments as in A were carried out using Myc-tagged TASR. 

the GenBank protein data base with the BLASTp program (16) 
revealed that the TASR protein shares similar structural mo- 
tifs with the SR family of splicing factors such as SC35, SF2/ 
ASF (17, 18), and SRp20 (19), the conserved glycine-rich hinge 
in SC35 and SF2/ASF is absent in TASR protein (Fig. 2B). 

Using RT-PCR, we have also cloned the human TASR cDNA 
(GenBank accession no. AF047448) from K562 leukemia cells. 
Sequencing analysis revealed that both mouse and human 
TASR coding regions are 93% identical at the nucleotide level 
and 100% identical at the amino acid level, indicating that 
TASR protein is evolutionariiy conserved. 

The finding that SC35 interacts with TLS in the yeast two- 
hybrid system suggested that these two proteins may also 
associate with each other in mammalian cells. To confirm their 
intracellular association, plasmids expressing Myc-SC35 and 
Flag-TLS or Flag-luciferase were co-transfected into COS-7 
cells, and lysates from the co-transfected cells were used for 
immunoprecipitation. A polyclonal rabbit anti-Flag antibody 
co-immunoprecipitated Myc-SC35 and Flag-TLS (Fig. 3A, lane 
1) but did not co-immunoprecipitate Myc-SC35 along with 
Flag-luciferase (Fig. 3A, lane 2). A normal rabbit IgG did not 
co-immunoprecipitate Myc-SC35 and Flag-TLS (Fig. 3A, lane 
3), indicating the specificity of the protein-antibody interaction. 
In the reciprocal immunoprecipitation, only Flag-TLS was co- 
immunoprecipitated with Myc-SC35 by a monoclonal mouse 
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FIG. 4. Effects of TLS, SC35, and TASR overexpression on E1A 
pre-mRNA splicing in HeLa cells. A, diagrams of E1A pre-mRNA 
splicing. Various E1A splicing isoforms are shown with dashed lines 
representing spliced sequences. Primers for RT-PCR analysis are indi- 
cated by arrows. B, in vivo alternative splicing of E1A pre-mRNA in 
HeLa cells. DNA combinations for all samples are indicated at the top. 
The total amount of DNA in each transfection was kept constant 
through addition of control plasmids. Different E1A splicing isoforms 
were amplified by RT-PCR, and all E1A bands were confirmed by 
hybridization to a 32 P-labeled E1A DNA probe. 

anti-Myc antibody (Fig. 3A, lane 4). Flag-luciferase failed to 
associate with Myc-SC35 and could not be detected in the 
immunoprecipitates (Fig. 3A, lane 5). When a normal mouse 
IgG was used in the experiment, neither Flag-TLS nor Myc- 
SC35 was present in the immunoprecipitates (Fig. 3A, lane 6). 

To confirm the intracellular association between TASR and 
TLS, plasmids expressing Myc-TASR and Flag-TLS or Flag- 
luciferase were also transfected into COS-7 cells for co-immu- 
noprecipitation studies. Myc-TASR and Flag-TLS formed a pro- 
tein complex that is recognized by specific antibodies directed 
against the epitope tags (Fig. 3B, lanes 1 and 4). Myc-TASR did 
not bind to Flag-luciferase (Fig. 3B, lanes 2 and 5), indicating 
the specificity of the interaction between TLS and TASR. Nor- 
mal IgG did not immunoprecipitate Myc-TASR or Flag-TLS 
(Fig. 3£, lanes 3 and 6). 

To investigate the functions of TLS, SC35, and TASR as 
splicing factors, we analyzed the effects of their overexpression 
on the alternative splicing of the adenovirus E1A pre-mRNA in 
HeLa cells. Differential splicing of E1A pre-mRNA generates 
five major isoforms (13, 12, 11, 10, and 9 S) (Fig. 4A) (20). When 
overexpressed in HeLa cells, TLS promoted splice site selection 
generating the 13 and 12 S isoforms (Fig. 4B, compare lanes 1 
and 2). In agreement with previous reports, expression of SC35 
did not increase 13 S isoform but decreased the 12 S isoform 
(Fig. 4B, compare lanes 1 and 3) (21). When both TLS and SC35 
were co-expressed, TLS was unable to promote the use of splice 
sites leading to 13 and 12 S isoforms (Fig. 4B, compare lanes 2 
and 4). The TASR protein also functioned as a splicing factor. 
In contrast to SC35, however, TASR expression decreased the 
13 S isoform and promoted the use of distal 5'-splice sites 
leading to the 11, 10, and 9 S isoforms (Fig. 4£, compare lanes 
1 and 5). When TLS and TASR were co-expressed, again the 
distal 5 '-splice sites resulting from TASR selection were fa- 
vored (Fig. 4B, compare lanes 2 and 6). Together, these results 
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indicate that SC35 and TASR display opposing effects on E1A 
pre-mRNA splicing and that the influence of TLS on El A 
pre-mRNA splicing is abrogated in the presence of overex- 
pressed SC35 or TASR. 

Considerable evidence now implicates TLS in binding to 
RNA. First, TLS protein contains structural motifs such as 
RGG and RNP-CS, which are conserved in many RNA-binding 
proteins (8). Second, bacterially expressed full-length TLS and 
its C terminus both bind to in vivo labeled RNA (1). The fact 
that this binding is enriched in the poly(A) + fraction of RNA 
suggests that the targets for TLS are in the mRNA fraction. 
Third, in vivo UV cross-linking studies indicate that TLS binds 
to RNA species that turn over rapidly inside cells (9). Fourth, 
immunocytochemical analysis localizes TLS protein to nuclear 
regions outside of the nucleolus, further supporting the notion 
that TLS binds to non-ribosomal RNA transcripts (1, 22). 

The evidence linking TLS to splicing of nascent pre-mRNA is 
also accumulating. TLS has been found to be associated with 
proteins, such as human nuclear RNP Al and SF1, that are 
implicated in splicing (23, 24). Also, as mentioned in the intro- 
duction, TLS induces the preferential use of the most distal 
5'-splice site during E1A pre-mRNA splicing in IW1-32 eryth- 
roid cells (10). At the present time, the mechanism by which 
TLS influences pre-mRNA splicing remains unclear. This 
study, for the first time, links TLS to the SR family of proteins, 
which have been extensively investigated for their role in RNA 
splicing. 

RNA processing is clearly a primary function of SR proteins. 
The N-terminal RNP consensus sequence of SR proteins is 
required for binding to RNA, whereas the C-terminal arginine- 
serine-rich domain is important in mediating association with 
other cellular proteins (25). In addition, phosphorylation of SR 
proteins has also been reported to affect both protein-protein 
and protein-RNA interactions, thus exerting further control 
over the enzymatic activity and intracellular localization of 
these splicing factors (26, 27). Although different SR proteins 
may have overlapping functions in constitutive splicing, they 
differ from each other in their ability to regulate alternative 
splicing as shown by previous studies (21, 28) and by the in vivo 
splicing assays of this paper. 

Important regulatory roles for SR proteins have been dem- 
onstrated by their functions in Drosophila development and 
sex determination (29-31), their involvement in T-cell activa- 
tion (32), their close association with cell cycle control (33), and 
their interaction with key cellular enzymes such as DNA topoi- 
somerase I (34). SR proteins have been found to be associated 
with various transcription units (35), linking SR proteins to the 
tightly coupled processes of transcription and splicing in mam- 
malian cells (36). The association of the SR family of splicing 
factors with RNA polymerase II has also been described by 
several groups (37-39), suggesting interactions between SR 
proteins and the C-terminal repeat domain of RNA polymerase 
II. 

Because of the potential functions of TLS and SR proteins, 
interactions between the SR family of proteins and TLS-related 
proteins may be an integral part of the control of normal cell 
growth and differentiation. Because our yeast two-hybrid 
screen suggests that the C-terminal region of TLS is involved in 
binding to SR proteins, interactions between TLS and SR pro- 
teins are likely to be altered or disrupted when this part of TLS 
protein is replaced through chromosomal translocations. The 



role that the TLS fusion proteins play in the TLS-SR protein 
interaction and their effect on RNA processing of TLS target 
genes are important questions for further study. Alterations of 
RNA splicing for several genes are associated with various 
tumor types (40). This study thus suggests that truncation of 
the TLS protein by translocation may result in the alteration or 
loss of critical RNA processing capability, leading cells on the 
path to malignant transformation. 
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FH Key Location/Qualifiers 
FH 

FT source 1. .620 

FT /db„xref ="taxon: 9606" 

FT /organism="Homo sapiens" 

FT /eel 1_1 ine = " K5 6 2 " 

FT / 1 i s su e_ t ype = " 1 eukemi a " 

FT CDS 30. .581 

FT /codon_start=l 

FT /db_xref =" SPTREMBL : 060572 " 

FT /note="has RNP motif and arginine/serine-rich (RS) domain' 

FT /product = "TLS -associated protein TASR" 

FT /protein_id="AAC70918 .1" 

FT / 1 rans 1 at ion= " MSRYLRPPNTSLFVRNVADDTRSEDLRREFGRYGPI VDVYVPLr 

FT YTRRPRGFAYVQFED VRDAEDALHNLDRKW I CGRQ I E I QFAQGDRKTPNQMKAKEGRfc 

FT YSSSRYDDYDRYRRSRSRSYERRRSRSRSFDYNYRRSYSPRNSRPTGRPRRSRSHSDI^ 

FT RPNCSWNTQYSSAYYTSRKI 11 

SQ Sequence 620 BP; 185 A; 135 C; 158 G; 142 T; 0 Other; 

SCORES Initl: 24d0 Initn: 2991 Opt: 3054 z-score: 1064.8 E(): 0 



99.5% identity ih 621 bp overlap 



40 50 60 70 80 90 



;d^^GGTTTGCTGAGCCCGTTAGTGCGC^^G< 



Sa2 47851 002 CTGAGCCGCd^PGGTTTGCTGAGCCCGTTAGTGCGCWGGCCGAGACACGCCGCCGCCA 

" I I I I I I I I I I I M I I I I I I ! I ! I I I I I I 
Af047448 AGTGCGCCCGGCCGAGACACGCCGCCGCCA 

10 20 30 

100 110 120 130 140 150 

Sa247851 002 TGTCCCGCTACCTGCGTCCCCCCAACACGTCTCTGTTCGTCAGGAACGTGGCCGACGACA 

1 1 1 1 1 1 1 f 1 1 1 1 1 4 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Af047448 TGTCCCGCTACCTGCGTCCCCCCAACACGTCTCTGTTCGTCAGGAACGTGGCCGACGACA 

40 50 60 70 80 90 

160 170 180 190 200 210 

Sa2 47851 002 CCAGGTCTGAAGACTTGCGGCGTGAATTTGGTCGTTATGGTCCTATAGTTGATGTGTATG 

I ! I I! Mill III II I MillMI II II II II I! IIIIIEIIMM II II I I I! Ml Ml 

Af047448 CCAGGTCTGAAGACTTGCGGCGTGAATTTGGTCGTTATGGTCCTATAGTTGATGTGTATG 

100 110 120 130 140 150 

220 230 240 250 260 270 

Sa2 47851 002 TTCCACTTGATTTCTACACTCGCCGTCCAAGAGGATTTGCTTATGTTCAATTTGAGGATG 
I II II I I I II I I I I I I II I II I I I II I I II I I I I M I M I I I I I I I II I I II I I II II II 

Af 0 4 7 4 4 8 TTCCACTTGATTTCTACACTCGCCGTCCAAGAGGATTTGCTTATGTTCAATTTGAGGATG 

160 170 180 190 200 210 

280 290 300 310 320 330 

Sa247851 002 TTCGTGATGCTGAAGACGCTTTACATAATTTGGACAGAAAGTGGATTTGTGGACGGCAGA 
I II II I I I I I I II I I I I I II I I I II I I I II I I I M I I I I I I I I II II I II II I I II I II I 
Af047448 TTCGTGATGCTGAAGACGCTTTACATAATTTGGACAGAAAGTGGATTTGTGGACGGCAGA 

220 230 240 250 260 270 

340 350 360 370 380 390 

Sa247851_002 TTGAAATACAGTTTGCCCAGGGGGATCGAAAGACACCAAATCAGATGAAAGCCAAGGAAG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml I MM I Ml 1 1 1 Ml IN 

Af 047448 TTGAAATACAGTTTGCCCAGGGGGATCGAAAGACACCAAATCAGATGAAAGCCAAGGAAG 

280 290 300 310 320 330 

400 410 420 430 440 450 

Sa247851 002 GGAGGAATGTGTACAGTTCTTCACGCTATGATGATTATGACAGATACAGACGTTCTAGAA 

I M I I II I I I II I II I I I II I I I I M I I M I I M I I I I I II I I I III I M I I II I I II II 
Af 047448 GGAGGAATGTGTACAGTTCTTCACGCTATGATGATTATGACAGATACAGACGTTCTAGAA 

340 350 360 370 380 390 

460 470 480 490 500 510 

Sa247851 002 GCCGAAGTTATGAAAGGAGGAGATCAAGAAGTCGGTCTTTTGATTACAACTATAGAAGAT 

II II I I I I II II II I I I II II I I I I II I I I I M I M II M I I I I II I I II I II I II I I I I 
Af 04 7 44 8 GCCGAAGTTATCAAAGGAGGAGATCAAGAAGTCGGTCTTTTGATTACAACTATAGAAGA^ 

400 410 420 430 440 450 

520 530 540 550 560 570 

Sa247851 002 CGTATAGTCCTAGAAACAGTAGACCGACTGGAAGACCACGGCGTAG - AGAAGCCATTCCG 
I I I I I I I I I I I I II I I I II II I II I II M I I II II I I I I II I II I I I I I M I I I I I I I 
Af 047448 MtATAGTCCTAGAAACAGTAGAC^ 

460.. 470 480 490 500 510 



580 590 600 610 620 630 

sa24785i_002 acaatgatagacc^^ctgcagctggaatacccagtacagttctgcttactacacttcaa 



iSl 1 1 M 1 1 I M 1 1 1 1 1 1 II I 1 1 1 1 1 A 



1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 

Af047448 ATAATCATAGa7:CAAACTGC^ 

520 530 540 550 560 570 

640 650 660 670 680 690 

Sa247851 002 GAAAGATCTGAAAGCGGAAAAAGAACCAAAGAAGGGCAGTTCAAGCGACCAAAGGGTGGG 

MINIM Mill MINIM MM III I II II II II II II II I II 1 1 1 1 

Af 04 744 8 GAAAGATCTGAAAGCGGAAAAAGAACCAAAGAA - GGCAGTTCAAGCGACCA 

580 590 600 610 620 

700 710 720 730 740 750 

S a2 4 7 8 5 1_0 0 2 TGGAAGGTGCTGCAGTATGAATACTGTACGAATATTTTGACTCTGGTCTGAAAAGATAAA 



I 
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KW 
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OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; Euthena; 

OC Primates; Catarrhini; Hominidae; Homo. 

RN [1] 

RP 1-620 

RX MEDLINE; 98447613. 

RA Yang L., Embree L.J., Tsai S., Hickstein D.D.; t 

RT "Oncoprotein TLS interacts with serine- arginine proteins involved m RNA 

RT splicing" ; 

RL J. Biol. Chem. 273 (43) : 27761-27764 (1998) . 

RN [2] 

RP 1-620 

RA Yang L., Embree L., Tsai S., Hickstem D.D.; 
RT 

RL Submitted (10-FEB-1998) to the EMBL/GenBank/DDBJ databases. 

RL Medicine/Oncology, University of Washington, 1660 S. Columbxan Way, GMR 

RL 151, Seattle, WA 98108, USA 

DR SPTREMBL; 060572; 060572. 

FH Key Location/Qualif iers 

FH 

FT source 1. .620 

FT /db_xref= l, taxon:9606" 

ft /organism="Homo sapiens" 

ft /cell_line="K562" 

FT /tissue_type=" leukemia" 

FT CDS 30. .581 

ft /codon_start=l 

FT /db_xref=" SPTREMBL: 060572" . 

ft /note="has RNP motif and arginine/serine-nch (RS) domain" 

ft /product="TLS-associated protein TASR" 

ft /protein id="AAC709l8 . 1" 

FT / trans 1 at ion= "MSRYLRPPNTSLFVRNVADDTRSEDLRREFGRYGPIVDVYVPLD 

Ft YTRRPRGFAYVQFEDVRDAEDALHNLDRKW I CGRQI E IQFAQGDRKTPNQMKAKEGRIT 

FT YSSSRYDDYDRYRRSRSRSYERRRSRSRSFDYNYRRSYSPRNSRPTGRPRRSRSHSDNi 

FT RPNCSWNTQYS SAYYTSRKI " 

SQ Sequence 620 BP; 185 A; 135 C; 158 G; 142 T; 0 other; 

SCORES (3) Initl: 1077 Initn: 1077 Opt: 1077 z-score: 749.5 E(): 0 
81.7% identity in 197,aa overlap 

10 20 30 40 50 

Sa24 7 8 5 1_0 0 0 MSRYLRPPNTSLFVRNVAppTRSEpLRREFGRYGPIVpyYVPLD 



I MM II 1 1 1 M Ml IN I M 1 1 1 1 1 1 1 1 1 1 M I M 1 1 1 1 M 1 1 1 M I M I 



IHAAA^^LRPPNTSLFVRNVADDTRSEDLR^^3 



Af 04 744 8 C^PRHAAA^ptLRPPNTSLFVR^ADDTRSEDLRRWGRYGPIVDVYVPLDFYTRRPR 

10 20 30 40 50 60 

60 70 80 90 100 110 

Sa247851 000 GFAYVQFEDVRDAEDALHNLDRKWI CGRQ I E I QFAQGDRKTPNQMKAKEGRNVYS S SRYD 

1 1 1 Mil II I IMIIIIIMII I II II! II II I II 

Af 047448 GFAYVQ FED VRDAEDALHNLDRKWI CGRQ I E I QFAQGDRKTPNQMKAKEGRNVYS S SRYD 

70 80 90 100 110 120 

120 130 140 150 160 170 

Sa247851 000 DYDRYRRSRSRSYERRRSRSRSFDYNYRRSYSPRNSRPTGRPRRREAIPTMIDQTAAGIP 

I III I Mill I I I IMIMI II I II III I IMIIII IIIMMI : ■ 

Af 047448 DYDRYRRSRSRSYERRRSRSRSFDYNYRRSYSPRNSRPTGRPRRSRSHSDNDRPNCSWNT 

130 140 150 160 170 180 

180 190 200 210 

S a2 4 7 8 5 1 0 0 0 STVLLTTLQERSESGKRTKEGQFKRPKGGWKVLQYEYCTNI LTLV 

: : : : : | : : : : | | | | | 

Af 04 744 8 QYSSAYYTSRKIXKRKKNQRRQFKRP 

190 200 




Sa247851_0001.Prot 
Trembl : 060572 

ID 060572 PRELIMINARY; PRT; 183 AA. 



AC 060572; 

DT 01-AUG-1998 (TrEMBLrel . 07, Created) 

DT 01-AUG-1998 (TrEMBLrel. 07, Last sequence update) 

DT 01-NOV-1999 (TrEMBLrel. 12, Last annotation update) 

DE TLS -ASSOCIATED PROTEIN TASR . 

OS Homo sapiens (Human) . . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 

OC Eutheria; Primates; Catarrhini; Hominidae; Homo. 

RN [1] 

RP SEQUENCE FROM N.A. 

RC T I S SUE = LEUKEM I A ; 

RA YANG L., EMBREE L., TSAI S., HICKSTEIN D.D.; 

RL Submitted (FEB-1998) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF047448; AAC70918:l; -. 

DR HSSP; P09651; 1UP1 . 

DR PFAM; PF00076; rrm; 1. 

SQ SEQUENCE 183 AA; 22222 MW; 070A7D15 CRC32; 



SCORES Initl: 1071 Initn: 1071 Opt: 1071 z-score: 2509.0 E(): 

Smith-Waterman score: 1071; 100.0% identity in 155 aa overlap 

10 20 30 40 50 60 

Sa247851_000 MSRYLRPPNTSLFVRNVADDTRSEDLRREFGRYGPI^ 

I I I I I I I I I I | | | | | | I I I I I I M I I I I I I I I I I I I I I t I I I I I t I 

060572 MSRYLRPPNTSLFVRNVADDTR 

10 20 30 40 50 60 

70 80 90 100 110 120 

Sa2 4 78 5 1_0 0 0 VRDAEDALHNLDRKWI CGRQ I E I QFAQGDRKTPNQMKAKEGRNVYS S S RYDD YDRYRRSR 

| | | | | | | | | | | | I I M I I I I II I I I I I I I I I M I I I I I I I I 1 1 1 1 1 1 1 1 1 M I I I I I I I I 

060572 ^AEDALrai^^ 

70 80 90 100 110 120 

130 140 150 160 170 180 

Sa2 4 7 8 5 1_0 0 0 SRSYERRRSRSRSFDYNYRRSYSPRNSRPTGRPRRREAIPTMIDQTAAGIPSTVLLTTLQ 

060572 SRSYERRRSRSRSFDYNYRRSYSPRNS^ 

130 140 150 160 170 180 

190 200 210 

S a2 4 7 8 5 1_0 0 0 ERSE SGKRTKEGQFKRPKGGWKVLQ YE YCTNI LTLV 
060572 RKI 



